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Abstract. This paper is a brief overview of the concepts involved in 
measuring the degree of contextuality and detecting contextuality in 
systems of binary measurements of a finite number of objects. We dis¬ 
cuss and clarify the main concepts and terminology of the theory called 
“contextuality-by-default,” and then discuss generalizations of the theory 
to arbitrary systems of arbitrary random variables. 
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1 Introduction 

1.1 On the name of the theory 

The name “contextuality-by-default” should not be understood as suggesting 
that any system of measurements is contextual, or contextual unless proven 
otherwise. The systems are contextual or noncontextual depending on certain 
criteria, to be described. The name of the theory reflects a philosophical position 
according to which every random variable’s identity is inherently contextual, i.e., 
it depends on all conditions under which it is recorded, whether or not there is a 
way in which these conditions could affect the random variable physically. Thus, 
in the well-known EPR-Bell paradigm, Alice and Bob are separated by a space¬ 
like interval that prevents Bob’s measurements from being affected by Alice’s 
settings; nevertheless, Bob’s measurements should be labeled by both his own 
setting and by Alice’s setting; and as the latter changes with the former fixed, 
the identity of the random variable representing Bob’s measurement changes 
“by default.” One does not have to ask “why.” Bob’s measurements under two 
different settings by Alice have no probabilisitic relation to each other; they 
possess no joint distribution. Therefore one cannot even meaningfully ask the 
question of whether the two may be “always equal to each other.” The questions 
one can ask meaningfully are all about what joint distributions can be imposed 
(in a well-defined sense) on the system in a way consistent with observations. 
A system is contextual or noncontextual (or, as we say, has a noncontextual 
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description) depending on the answers to these questions. Thus, the famous 
Kochen-Specker demonstration of contextuality is, from this point of view, a 
reductio ad absurdum proof that measurements of a spin in a fixed direction 
made under different conditions cannot be imposed a joint distribution upon, in 
which these measurements would be equal to each other with probability 1. 

1.2 Notation 

We use capital letters A, B,..., Q to denote sets of “objects” (properties, quan¬ 
tities) being measured, and the script letter to denote a collection of such 
sets. We use capital letters i?, S, and T to denote the measurements (random 
variables), and the Gothic letter 91 to denote sets of random variables that do 
not possess a joint distribution. 

2 Contextuality-by-Default at a Glance 

The following is an overview of the main concepts and definitions of the contextuality- 
by-default theory. This is not done at a very high level of generality, in part in 
order not to be too abstract, and in part because the criterion and measure of 
contextuality have been developed in detail only for a broad subclass of finite 
sets of binary measurements. Thus, the notion of a context given below in terms 
of subsets of measured objects is limited, but it is intuitive, and a way to gen¬ 
eralize it is clear (Section [S]). The definition of maximally connected couplings 
is given for binary (±1) measurements only, and reasonable generalizations here 
may not be unique. In Section [S] we discuss one, arguably most straightforward 
way of doing this. 

2.1 Measurements are labeled contextually 

There is a set Q ol’’’'objects" we want to measure. For whatever reason, we cannot 
measure them “ all at once" (the meaning of this is not necessarily chronological, 
as explained in Section 0]). Instead we define a collection of subsets of Q, 

^={AcQ,BcQ,...}, (1) 

and measure the objects “one subset at a time." We call these subsets of objects 
contexts. Different contexts may overlap. This definition has limited applicability, 
and we discuss a general definition in Section O For now we consider only finite 
sets Q (hence finite collections of finite contexts). 

The measurement outcome of each object q (from the set Q) in each context 
C (from the collection to) is a random variable, and we denote it (with 
q € C G This is called contextual labeling of the measurement outcomes. It 
ensures that the collection 

{<},«. - ( 2 ) 

for all A., B,... comprising are pairwise disjoint, as no two random variables 
taken from two different members of the collection have the same superscript 
(whether or not they have the same subscript). 
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2.2 Measurements in different contexts are stochastically unrelated 

We call R'^ = for every C € ^ a bunch (of random variables). The 

random variables within a bunch are jointly distributed, because of which we can 
consider each bunch as a single (multicomponent) random variable, li q, q' € C G 
we can answer questions like “what is the correlation between and R^, ?”. 
However, ii q € C, q' € C', and C ^ C, then we cannot answer such questions: 
Rg and R^, belong to different bunches and do not have a joint distribution. 
We say that they are stochastically unrelated. 

2.3 All possible couplings for all measurements 

Consider now the (necessarily disjoint) union of all bunches 

U U (3) 

ce«’ 

i.e., the set of all measurements contextually labeled. The use of the Gothic font 
is to emphasize that this set is not a multicomponent random variable: except 
within bunches, its components are not jointly distributed. We call CH a system 
(of measurements). 

Now, we can be interested in whether and how one could impose a joint 
distribution on 91. To impose a joint distribution on 91 means to find a set 
of jointly distributed random variables S = {S^} such that, for every 

= = ( 4 ) 

The symbol ^ means “has the same distribution as.” Note that 

S = {*“}«» = {SfW» (5) 

is a single (multicomponent) random variable, and in probability theory S is 
called a coupling for (or of) 91. Any subset of the components of S is its marginal, 
and S'^ is the marginal of S whose components are labeled in the same way as 
are the components of the bunch i?^. 

If no additional constraints are imposed, one can always find a coupling S for 
any union of bunches. For instance, one can always use an independent coupling: 
create a copy of each bunch R^ (i.e., an identically labelled and identically 
distributed set of random variables), and join them so that they are stochastically 
independent. The set S = is then jointly distributed. The existence 

of a coupling per se therefore is not informative. 


2.4 Connections and their couplings 

Let us form, for every object q, a set of random variables 

^9 = {Rq } j 


( 6 ) 
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i.e., all random variables measuring the object q, across all contexts, We call 
this set, which is not a random variable, a connection (for q). Let us adopt the 
convention that if a context C does not contain q, then is not defined and 
does not enter in VfKq. 

A system is called consistently connected if, for every q G Q and any two 
contexts C, C containing < 7 , 

Rq^Rq- ( 7 ) 

Otherwise a system is called (strictly) inconsistently connected. Without the 
adjective “strictly,” inconsistent connectedness means that the equality above is 
not assumed, but it is then not excluded either: consistent connectedness is a 
special case of inconsistent connectedness. 

One possible interpretation of strictly inconsistent connectedness is that the 
conditions under which a context is recorded may physically influence (in some 
cases one could say, “signal to”) the measurements of the context members. An¬ 
other possibility is that a choice of context may introduce biases in how the 
objects are measured and recorded. 


2.5 Maximally connected couplings for binary measurements 

Every coupling S for has a marginal Sq = that forms a coupling for 

the connection We can also take 9lq for a given q in isolation, and consider 
all its couplings Tq = Clearly, the set of all Sq extracted from all 

possible couplings S for is a subset of all possible couplings Tq for 

Let us now conhne the consideration to binary measurements: each random 
variable in the system has value -1-1 or —1. In Section [S] we will discuss possible 
generalizations. 

A coupling Tq for a connection fHg is called maximal if, given the expected 
values {Rq) for all C, the value of 

eq [Tq) = Vr[T^ = I-.C Pr [T^ = -1 : C G ( 8 ) 

is the largest possible among all couplings for d\q (again, R^ and T^ are not 
defined and are not considered if C does not contain q). 

Let us denote 


maxeq(91,j)= max eq (T,). (9) 

all possible 
couplings Tgfor 

It follows from a general theorem mentioned in Section [S] that this quantity is 
well-defined for all systems, i.e., that the supremum of eq (Tq) is attained in some 
coupling Tq. Clearly, for consistently connected systems maxeq(9Iq) = 1 (the 
measurements can be made “perfectly correlated”). For (strictly) inconsistently 
connected systems, maxeq {^q) is always well-defined, and it is less than 1 for 
some q. It may even be zero: for ±I variables this happens when the Dig contains 
two measurements R^ and Rq such that Pr [R^ = l] = 1 and Pr [R^ = l] =0. 
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2.6 Definition of contextuality 

Consider again a coupling S for the entire system and for every q G Q, extract 
from S the marginal Sq that forms a coupling for the connection d\q. 

Central Concept. If, for every q € Q, 

eq (S'q) = maxeq (IHg), (10) 

(i.e., if every marginal Sq in 5" is a maximal coupling for d\q) then the coupling 
S for 91 is said to be maximally connected. 

Intuitively, in this case the measurements can be imposed a joint distribution 
upon in which the measurements of every object q in different contexts C 
are maximally “correlated,” i.e., attain one and the same value with the maximal 
probability allowed by their observed individual distributions (expectations). 

Main Definition. A system 91 is said to be contextual if no coupling S of 
this system is maximally connected. Otherwise, a maximally connected coupling 
of the system (it need not be unique if it exists) is said to be this system’s non- 
contextual description (or, as a terminological variant, maximally noncontextual 
description). 

For consistently connected systems this definition is equivalent to the tra¬ 
ditional understanding of (non)contextuality. According to the latter, a system 
has a noncontextual description if and only if there is a coupling for the mea¬ 
surements labeled noncontextually. The latter means that all random variables 
Rq within a connection are treated as being equal to each other with probability 
1 . 


2.7 Measure and criterion of contextuality 

If (and only if) a system 91 is contextual, then for every coupling S there is at 
least one q € Q such that eq (Sq) < maxeq (91^). This is equivalent to saying 
that a system is contextual if and only if for every coupling S of it. 


(‘^ 9 ) < maxeq (91g). (11) 

q&Q q&Q 


Define 


maxeq (91) 


max 

all couplings 
S for ^ 


<.q^Q 


( 12 ) 


In this definition we assume that this maximum exists, i.e., the supremum of the 
sum on the right is attained in some coupling S. (This is likely to be true for 
all systems with finite Q and binary measurements, but we only have a formal 
proof of this for the cyclic systems considered below.) Then m is equivalent to 


maxeq (91) < X maxeq (91^), 

q&Q 


(13) 
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which is a criterion of contextuality (necessary and sufficient condition for it). 
Moreover, it immediately leads to a natural measure of contextuality: 

cntx(lH) = ^ maxeq (IHg) — maxeq (fH). (14) 

qGQ 

Written in extenso using aird m. 


cntx (fH) = 


y max 

all possible 
couplings Tgfor IHg 


eq {Tq) 


max 

all couplings 
S for 


(*^9): 

qeQ 


(15) 


where, one should recall, Sq is the marginal of S that forms a coupliirg for 91q. 
We can see that the minuend and subtraheird in the definition of cntx (91) differ 
in order of the operations max and while in the miiruend the choice 

of couplings Tq for 91g is uncoirstrained, iir the subtrahend the choice of couplings 
Sq for 91q is coirstrained by the requirement that it is a marginal of the coupling 
for the entire system 91. 


3 The history of the contextuality-by-default approach 


A systematic realization of the idea of contextually labeling a system of mea¬ 
surements 91, considering all possible couplings S for it, and characterizing it 
by the marginals Sq that form couplings for the connections 91g of the system 
was developing through a series of publications EH7]. The idea of maximally 
connected couplings as the central concept for contextuality in consistently con¬ 
nected systems was proposed in Refs. EiiHiin] and then generalized to inconsis¬ 
tently connected systems [iiiini. 

In the latter two references the measure of contextuality (ITT)) and the criterion 
of contextuality m were defined and computed for simple QM systems (cyclic 
systems of rank 3 and 4, as defined below). Later we added to this list cyclic 
systems of rank 5, and formulated a conjecture for the measure and criterion 
formulas for cyclic systems of arbitrary rank |12| . 

In Refs. [T^IITB] the contextuality-by-default theory is presented in its current 
form. The conjecture formulated in Ref. m was proved in Ref. nni. 

A cyclic system (with binary measurements) is defined as one involving n 
“objects” (n being called the rank of the system) measured two at a time. 


( 91 , 92 ), ( 92 , 93 ), • ■ ■, {qn-i,qn) , {qn,qi) ■ 


(16) 


For i = 1,..., n, the pair (qi, qn^i) forms the context Ci (© standing for circular 
shift by 1). Each object qi enters in precisely two consecutive contexts, CiQi and 
Ci- Denoting the measurement of qi in context Cj by Rj, we have the system 
represented by bunches R* = and connections 91^ = 

The formula for the measure of contextuality conjectured in Ref. m and 
proved in Ref. [19] is 


cntx (91) = - max 


Sodd ((^i^)©i) : * — 1, • ■ 
0 


(17) 


2 ) 
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The function Sodd is defined for an arbitrary set of argument Xi,... ,Xk as 

Sodd {xi,...,Xk)= max (ixi ±...±Xk) , (18) 

where the maximum is taken over all assignments of + and — signs with an odd 
number of —’s. The criterion of contextuality readily derived from CZD is: the 
system is contextual if ands only if 

n 

Sodd : * = 1,..., n) > (n - 2) + ^ I (i?() - (i?f i) | . (19) 

i=l 

For consistently connected systems, the sum on the right vanishes, and we can 
derive the traditional formulas for Legget-Garg (n = 3), EPR/Bell (n = 4), and 
Klyachko-Can-Binicioglu-Shumovsky-type (KCBS) systems {n = 5). But the 
formula also allows us to deal with the same experimental paradigm when they 
create inconsistently connected systems, due to signaling or contextual biases in 
experimental design (see Refs. for details). 

The contextuality-by-default theory does have precursors in the literature. 
The idea that random variables in different contexts are stochastically unrelated 
was prominently considered in Refs. PlfTWT). Probabilities of the eq-type with 
the contextual labeling of random variables, as defined in ([5]), were introduced 
in Refs. The distinguishing feature of the contextuality-by-default 

theory is the notion of a maximally connected coupling, which in turn is based 
on the idea of comparing maximal couplings for the connections taken in iso¬ 
lation and those extracted as marginals from the couplings of the entire sys¬ 
tem. Contextuality-by-default is a more systematic and more general theory of 
contextuality than those proposed previously, also more readily applicable to 
experimental data muH]. 

4 Conceptual and Terminological Clarifications 

4.1 Contextuality and quantum mechanics (QM) 

The notion of (non)contextuality has its origins in logic [55], but since the pub¬ 
lication of Ref. m it has been widely considered a QM notion. QM indeed pro¬ 
vides the only known to us theoretically justified examples of contextual systems. 
(Non)contextuality per se, however, is a purely probabilisitic concept, squarely 
within the classical, Kolmogorovian probability theory (that includes the notion 
of stochastic unrelatedness and that of couplings) [71|51[TT]. When contextuality 
is present in a QM system, QM is relevant to answering the question of exactly 
how the noncontextuality conditions in the system are violated, but it is not 
relevant to the question of what these conditions are. 

4.2 Contexts and QM observables 

In particular, the “objects” being measured need not be QM observables. They 
may very well be questions asked in a poll of public opinion, and the binary 
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measurements then may be Yes/No answers to these questions. It is especially 
important not to confuse being “measured together” in the defiirition of a context 
with being represeirted by compatible (commutative) observables. Thus, iir the 
theory of contextuality in cyclic systems, n = 4 is exemplified by the EPR-Bell 
paradigm, with Alice’s “objects” (spins) being 51,^3 and Bob’s q 2 ,qi- In each of 
the contexts (gi, 52 ) > • ■ • > (Q 4 ) qi)^ the two objects are compatible in the trivial 
sense: any observable iir Alice’s Hilbert space Ha is compatible with airy ob¬ 
servable in Bob’s Hilbert space Hb because the joint space is the tensor product 
Ha <S)Hb- The case n = 5 is exemplified by the KCBS paradigm, where the spins 
gi,... ,^5 are represented by observables in three-dimensional Hilbert space. In 
each of the five contexts (gi, 52 ), ■. •, ( 95 , 91 ) the observables are compatible in 
the narrow QM sense: they are commuting Hermitian operators. The case n = 3 
is exemplified by the Leggett-Garg paradigm, where three measurements are 
made at three distinct time moments, two measurements at a time. The QM rep¬ 
resentations of the observables in each of the contexts ( 91 , 92 ) > ( 92193 ), ( 93 , 91 ) 
are generally incompatible (noncommuting) operators. In spite of the profound 
differences in the QM structure of these three cyclic systems, their contextually 
analysis is precisely the same mathematically, given by (13 and (na. 

4.3 The meaning of being measured “together” 

It should be clear from the discussion of the Leggett-Garg paradigm that “mea¬ 
suring objects one context at a time” does not necessarily have the meaning 
of chronological simultaneity. Rather one should think of measurements being 
grouped and recorded in accordance with some fixed coupling scheme: if 9 and 
q' belong to the same context C, there is an empirical procedure by which ob¬ 
servations of are paired with observations of . Thus, if the objects being 
measured are tests taken by students, and the measurements are their test scores, 
the tests are grouped into contexts by the student who takes them, however they 
are distributed in time. The grouping of (potential) observations is in essence 
what couplings discussed in Sections 12.31 and 12.41 do for a set of stochastically un¬ 
related random variables, except that these couplings do not provide a uniquely 
(empirically) defined joint distribution. Rather the probabilistic couplings im¬ 
posed on different bunches are part of a purely mathematical procedure that 
generally yields an infinity of different joint distributions. 

4.4 The meaning of a noncontextual description 

In the traditional approach to (non)contextuality, where the measurements are 
labeled by objects but not by contexts, one can define a noncontextual de¬ 
scription as simply a coupling imposed on the system. For instance, in the 
Leggett-Garg paradigm the noncontextual labeling yields three random vari¬ 
ables, i?i, i? 2 , R 3 , with (i?i, R 2 ), (^ 2 ,-^ 3 ), (R 3 , Ri) jointly observed. A noncon¬ 
textual description here is any three-component random variable S = (^i, S' 2 , S 3 ) 
with (S'i,S' 2 ) ~ (i?i,R 2 ), { 82 , 83 ) ~ {R 2 ,R 3 ), and (S' 3 ,<S'i) ~ (i? 3 ,Ri). The sys¬ 
tem is contextual if no such description exists. 
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The situation is different with contextually labeled measurements. For the 
Leggett-Garg paradigm we now have six variables grouped into three stochas¬ 
tically unrelated contexts, , (-^ 21 -^ 3 ) j A® explained in Sec¬ 

tion 12.31 such a system always has a coupling, in this case a sextuple S with 
( 51 , 51 ) (SlSi) ^ and (51,5?) ^ (i?i,i??). One can 

call any of these couplings a noncontextual description of the system. To charac¬ 
terize (non)contextuality then one can use the term “maximally noncontextual 
description” for any maximally connected coupling m- Alternatively, one can 
confine the term “noncontextual description” of a system only to maximally con¬ 
nected couplings for it. With this terminology the definition of a contextual 
system in our theory is the same as in the traditional approach: a system is con¬ 
textual if it does not have a noncontextual description. The choice between the 
two terminological variants will be ultimately determined by whether couplings 
other than maximally connected ones will be found a useful role to play. 


5 Instead of a Conclusion: Generalizations 

5.1 Beyond objects and subsets 

Defining a context as a subset of objects measured together HMH] is less general 
than defining it by conditions under which certain objects are measured duni. 
For instance, by the first of these definitions {gi, 52 } for a given pair of objects 
is a single context, while the second definition allows one to speak of the same 
pair of objects qi,q 2 forming several different contexts. Thus, if (Zii<Z 2 Sure two 
tests, they can be given in one order or the other, (qi, 52 ) or {qi, 52 )- In fact, in 
all our previous discussion of cyclic systems we used the notation for ordered 
pairs, (qi, gi 0 i) rather than {qi, giQi}. This is inconsequential for cyclic systems 
of rank n > 3. For n = 2, however, the difference between ( 91 , 92 ) and ( 92 , 91 ) is 
critical if n = 2 is to be a nontrivial system (with the distributions of the two 
bunches not identical). It can be shown that the system can be nontrivial, and 
n = 2 is a legitimate value for (fT71) and (fTOl) . 

Being formal and mathematically rigorous here makes things simpler. A con¬ 
text is merely a label (say, superscript) at a random variable with the conven¬ 
tion that identically superscripted variables are “bunched together,” i.e., they 
are jointly distributed. An object is merely another label (in our notation, a 
subscript) that makes all the elements of a bunch different and indicates which 
elements from different bunches should be put together to form a a connection. 
So if there are six random variables grouped into three distinct bunches (i?(, R^), 
(R\, R 2 ), and (i?f, R 2 ) and into two connections [RI,RI, Rf) and (i? 2 , i? 2 , i??) 1 
we can (but do not have to) interpret this as three different contexts involv¬ 
ing the same two objects. From mathematical (and perhaps also philosophical) 
point of view, measurements grouped into bunches and connections are more 
fundamental than objects being measured within contexts. 
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5.2 Beyond binary measnrements 

How could the definition of a maximally connected coupling be generalized to 
arbitrary random variables? A straightforward way to do this is to extend defi¬ 
nition ([5]) for a coupling Tq of a connection IHg as 


eq {Tq) = Pr 



for any two C^C' 


( 20 ) 


This is an approach adopted in [IIIIIIH]. It is based on the following math¬ 
ematical considerations, derived from the discussion of maximal couplings in 
Thorisson’s monograph [M] (Section 7 of Chapter 3). 

Given two sigma-additive measures p and v on the same sigma algebra, let 
us write p < z/ if /i (E) < v {E) for every measurable set E. Let /i^ be the 
probability measure associated with . Let Xq and Sq be the set of values 
and sigma algebra associated with (they are assumed the same for all C, 
because otherwise one should not consider R^ measurements of one and the 
same object). For every object g, define pg as the largest sigma-additive measure 
such that pg < for all contexts C. The measure fiq is the largest in the sense 
that fiq < fj,q for any other measure /ig such that /ig < for all contexts 
C. A theorem proved in Ref. [M] (Theorem 7.1) guarantees the existence and 
uniqueness of /ig, for any set of probability measures whatever the 

indexing set That is, pg is uniquely defined for any connection IHg. Note that 
fiq is not generally a probability measure, so pg (Xq) can be any number in [0,1]. 
Let us denote 

maxeq (IHg) = pg (Xg) . (21) 

For ± 1-measurements R^ this definition specializes to ([S])-®. 

Consider now a coupling Tq for Dig. It is defined on the product sigma-algebra 
^cg’Eq on the product set Y\cgXq. An event Eq G is called a coupling 

event if Sq G Eq implies Tq = Tq for any two C,C' G^ (assuming, as always, 
that both C and C involve q). It follows from Theorem 7.2 in Ref. [Mj that 


Pr [Tg e Eq] < maxeq (91g), 


( 22 ) 


for any q and any choice of Eq. Now, it is natural to define a maximal coupling 
for IHg as a coupling Tq for which Eq can be chosen so that 

Pr [Tg G Eq] = maxeq (Dig). (23) 

Theorem 7.3 in Ref. [23] says that such a maximal coupling always exists. Note 
that Eq in a maximal coupling can always be thought of as the largest measurable 
subset of the diagonal of the set 

Having established this generalized notion of a maximal coupling, the theory 
of contextuality can now be generalized in a straightforward fashion. Consider 
a coupling S for the entire system 91. The definition of a maximally connected 
coupling remains unchanged: every marginal Sq of a maximally connected cou¬ 
pling 5" is a maximal coupling for the corresponding connection 9ig. Our Main 
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Definition could remain unchanged too: a system is contextual if and only if 
no coupling S of this system is maximally connected. This can be equivalently 
presented as follows. For any set P of probability values, let f{P) be a bounded 
smooth nonnegative function strictly increasing in all components of P. Thus, 
for finite systems of random variables / can be chosen as a sum or average, as 
in (fTT|) . Define 


max eq {D\) = max / (Pr [Sq G Eg] : q £ Q) , (24) 

all couplings 
S for m 

and, if this value exists, 

cntx (94) = / (maxeq (94,) : q € Q) — maxeq (94). (25) 

The system is defined as contextual if and only if cntx (94) > 0. We do not know 
whether maxeq (94) exists for all possible systems of random variables. If it does 
not, however, the definition can be extended by replacing max with sup. 

This generalization has to be further explored to determine whether it is a 
good generalization, i.e., whether it provides valuable insights, leads to inter¬ 
esting mathematical developments, and does not yield non-interpretable results 
when applied to specific systems of measurements. 
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